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Objectives 


The principal objective of this unit is to establish some general important 
results which will form the basis of applications to be considered in later 
units. 


After working through this unit, you should be able to: 
(i) explain the meanings of the terms: 
dimension of a vector space, 
morphism from one vector space to another, 
kernel of a morphism, 
matrix, 
identity matrix ; : : : 
(ii) understand the proofs of the theorems about morphisms of vector 
spaces which are listed in section 23.1.6; 
(iii) add and multiply (suitable) matrices; 
(iv) form a scalar multiple of a matrix. 


Note 


Before working through this correspondence text, make sure you have 
read the general introduction to the mathematics course in the Study 
Guide, as this explains the philosophy underlying the whole course. 
You should also be familiar with the section which explains how a text 
is constructed and the meanings attached to the stars and other symbols 
jn the margin, as this will help you to find your way through the text. 
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Linear Equations 
23.2.1 


An Algebra of 
Morphisms 
23.1.5 


Matrices Linear Algebra 
29.2:2 Units 26 and 28 


Matrix 
Multiplication 
23.2.3 


Geometrical 


Applications 
23.2.4 


Addition and Scalar 
Multiplication of 
Matrices 


23.2.5 


Glossary 


Terms which are defined in this glossary are printed in CAPITALS. 


EQUALITY OF 
MATRICES 


IDENTITY MATRIX 


INVARIANT SET 


LINEAR EQUATION 


LINEAR => 
TRANSFORMATION 


MATRIX (m x n) 


MATRIX ADDITION 


MATRIX 
MULTIPLICATION 


SUBTRACTION 


MORPHISM 


MULTIPLICATION OF - 
A MATRIX BY A 
SCALAR 


NULL SPACE 


SUBSPACE 


Two MATRICES are EQUAL if they have the same 
numbers of rows and columns, and the correspond- 
ing elements are the same. 


The IDENTITY MATRIX is a MATRIX with the same 
number of rows as columns, in which all the 
elements on the diagonal from the top left-hand 
corner to the bottom right-hand corner are l's, 
and all other elements are 0’s. 


An INVARIANT ELEMENT under a function f is an 
element a of the domain of f such that f(a) = a. 


An INVARIANT SET under a function f is a subset A 
of the domain of f such that f(A) = A. 


The KERNEL of a MORPHISM is the set of elements 
in the domain which map to the zero element in 
the codomain of the morphism. 


A LINEAR EQUATION is an equation of the form 
1X1 + 25x25 + <- + a,x, = 0, 
where the a’s are constants. 


A LINEAR TRANSFORMATION is a MORPHISM from one 


vector space to another. 


A MATRIX (m x n)isarectangular array of numbers, 
having m rows and n columns. - 


See page 43. 
See page 37. 


See page 43. 


If Lisa mapping from a vector space V to a vector 
space U such that 


L(x,v, + &202) = o,L(t) + aL(v;), 
for any a, , 4; € R, v,, v; € V, then Lis a MORPHISM 


from V to U. 


See page 43. 


The NULL SPACE of a MORPHISM is the KERNEL of the 
morphism. 


If S is a subset of a vector space U, and S is itself a 
vector space, then S is a SUBSPACE of U. 
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Notation 
i, j,k 


R" 


Ry 


~Œ 


Geometric vectors of length one in the directions of 
the x, y and z Cartesian axes respectively. 


The set of all n-tuples of real numbers, i.e. the 
Cartesian product, 
RxRx--xR. 
AZ 
nterms 
The function f, considered as an element of a 
vector space. 


A temporary symbol for the operation of addition 
on a vector space V. 


The zero element of the vector space V. 


(1,0,...,0) 
(0, 1,..., 0) 

. These n-tuples form a basis for A". 
(0,0,..., 1) 


A matrix with two rows and three columns. 


A temporary symbol for the operation of multi- 
plication of two matrices A and B (in that order) 
where B has just one column. 


A temporary symbol for the operation of matrix 
multiplication. 


The matrix corresponding to the mapping which 
rotates the plane about the origin through an angle 
0 anti-clockwise, i.e. 


cos 0 $m 


sin 0 cos 0 


The identity matrix. 


A temporary symbol for the operation of matrix 
addition. 
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23.1 MAPPINGS OF VECTOR SPACES 23.1 
23.1.0 Introduction 23.1.0 
In Unit 22, Linear Algebra I we considered the set of geometric vectors, Introduction 


and defined the terms linear dependence and independence, dimension 
and basis. We then extended our use of these terms from a geometric 
context to an algebraic one: we defined a vector space. The examples of 
vector spaces given in Unit 22 are interesting because, although they look 
very different, they all belong under the same umbrella. However, just 
looking at examples does not give us any extra equipment for solving 
problems. In this unit we shall establish some general results for vector 
spaces. 


You will recall that in Unit 71 we began by mapping sets one to another, 
and we introduced the concept of a function. With this concept we found 
that we could think about problems which are more sophisticated and 
have wider application than simple arithmetic calculations. It is the same 
with vector spaces ; the topic becomes richer and more interesting when 
we introduce mappings from one vector space to another. 


23.1.1 Mapping One Vector Space to Another 23.1.1 


Some mappings of vector spaces to vector spaces are simply equivalent 
to a change of notation. 


Example 1 ; Example 1 


We know that the set of geometric vectors forms a vector space, as does 
the set of all lists with three elements. 


Ify = xi + yj + zk, then the mapping 
x 
niy— |y], 
Z| 


with domain the set of geometric vectors, simply gives us an alternative 
notation. 


(x,y,z) 
hee tee y A 


(i, j and k are the unit geometric vectors in the directions of the x, y and z 
Cartesian axes respectively : see Unit 22, Linear Algebra I.) 
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Under this mapping we have 


1 


n:i-—>10 


n:k-—>| 0 
1 


We can calculate n (y) for any geometric vector y once we know the images 
of the base vectors. mu 


Notice that if we want a mapping to define a new notation for the elements 
in its domain then the mapping must be one-one. 


Another “notational” mapping is the mapping of ordered lists to ordered 
pairs defined by 


x 
| | ==) 
y 


or ordered lists to ordered triples defined by 


x 
1167, 2) 


Z 


which simply states the obvious fact that, given a co-ordinate system, 
we can represent a two-element list by a point in a plane and a three- 
element list by a point in three-dimensional space. We already have a 
name for the set of all ordered pairs of real numbers; we call it R x R. 
This is usually shortened to R?. R? is used to denote the set of all ordered 
triples of real numbers, and R" is used to denote the set of all ordered 
n-tuples of real numbers. We have seen that R", with addition and mul- 
tiplication by a scalar defined in the usual way, is a vector space of dimen- 
sion n. We shall refer to this vector space as R", leaving the operations to 
be understood. 


Example 2 Example 2 


Consider the mapping of R? to R? defined by 
Fx, y) —9À (2x, 2y). 


What effect does this mapping have? One way to start to answer this 
question is to consider what happens to a few particular elements. 


Take the element (1, 1); then f(1, 1) — (2, 2). 


2 e (2,2) 
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For the element (0, 1), we have f(0, 1) = (0, 2). 


(0,2) 
(0,1) 


In general, we see that the point P is mapped on to the point P' in the 
same direction away from the origin O, where OP’ = 20P. (Does any 
point remain unchanged?) 


Consider a set of elements in the plane, such as 
fix. px eye 


These points lie on a circle of unit radius, and therefore their images 
under this mapping will lie on a circle of radius 2. We can verify this 
algebraically as follows. Suppose (x, y) (u, v), then u = 2x and v = 2y, 
and so if x and y satisfy the equation 


xy =4, 


u and v must satisfy the equation 


eft 


ie. u? +0 =A 
Thus : 
(G5 y):x? + y? = 11 —9 {(u, v):u? + v? = 4} (See RB4) 
= (05 y):x? + y? = 4} 


(We have re-written this set in terms of x and y so that we can draw the 
following diagram.) 


— 


Example 3 Example 3 
Consider the mapping of R? to R? defined by 
(x, y) — (=y, x). 


Let us have a look at what happens to the base vectors (1, 0) and (0, 1). 
We have 


(1,0) —> (0,1) and (0, 1)—(—1, 0). 


y y 
(0,1) (0,1) 
(1,0) * C10) x 


The mapping has the effect of rotating the base vectors through an angle 
T 3 = == 
3 counter-clockwise about the origin, and this is indeed the effect on the 


entire plane. 


In this example, any circle centred at the origin maps onto itself. Every 
point of the set {(x, y):x? + y? = 1} moves (for example, (2, $——>(—£, 9), 
but the set itself remains unchanged. E 


Example 3 provides a particular example of a simple but important 
mathematical concept. Given a function f of a set A to itself, then if, 
for instance, 


fa) =a (acA), 
we say that a is an invariant element under f. 


If the set S as a whole maps to itself, then S is called an invariant set under f, 
even if some or all of the elements of the subset S of A are moved under f. 
In the above example we have an invariant circle, i.e. an invariant set of 
points. The concept of invariance can be even more general. For example, 
under a translation of the plane, the distance between two points is 
invariant ; that is, if P and Q map to P' and Q' respectively, then the length 
of PQ = length of P'Q'. Invariance is a very important and general 
concept in mathematics, and we shall have more to say about it in the 
remainder of this course and subsequently. In fact, we encountered 
invariance very early in the Foundation Course. When we are asked to 
solve an equation f(x) — 0 and we choose to do it iteratively, then we 
usually rearrange the equation in the form: 


x = F(x); = 
that is, we look for the invariant elements under the function F. (See 
Unit 2, Errors and Accuracy.) 
Exercise 1 
Under the mapping 

f(x, y) —> (2x, 2y), 
in Example 2, we have seen that circles centred at the origin are not 
invariant. But some lines are invariant. Which lines? m 
Example 4 
Consider the mapping of R? to R? defined by 

(x, y)—*(—-». y). 

From the previous two examples, it seems that, to see the effect of the 
mapping, it is probably worth seeing what happens to a set of base vectors. 
Again, to simplify the arithmetic, we choose the base vectors (1,0) and 
(0, 1). We find that 

(1, 0)— (0, 0) 


(0, E 1, 1), 
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1 


LIS 


Definition 2 


Exercise 1 
(1 minute) 


Example 4 


so that in terms of the corresponding geometric vectors j and j, we have 
that j maps to — i + j, but į maps to the zero vector. 


(03) fi, 


We shall look at this mapping in a little more detail. What happens to the 
x-axis? On this axis y is zero and so every point on the x-axis maps to 
(0,0): the entire x-axis shrinks into the origin. What about the line 
y = 1? Every point on this line maps to the point (— 1, 1). 


In fact, the entire plane maps on to the line whose equation is 
x+y=0 


In terms of geometric vectors, every element in the image set can be 
represented in terms of the vector — i + j. 


The image set has dimension 1, and so the effect of the mapping is to 
“lose” a dimension from our vector space. This is equivalent to saying 
that this mapping is many-one. If we start with a point, P say, in the plane 
and map it to a point Q on the line x + y = 0, then we cannot map back 
to the original point P. This is because the point Q on the line x + y = 0 
corresponds to the whole of the line parallel to the x-axis through P ; this 
line is the equivalence class containing the element P. (In Unit 19, Rela- 
tions we showed that any many-one mapping defines an equivalence 
relation on its domain. This is an example. The relation p is (a, b) p (c, d) 
if b = d.) 3 
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(continued on page 6) 


| 


Solution 1 
Straight lines which pass through the origin. E 


(continued from page 5) 


We have chosen these geometric examples to give you a visualization of 
the sort of mappings we are going to consider. One of the pleasant features 
of linear algebra is that by considering a geometric situation we can often 
throw light on non-geometric situations (and vice versa). Thus, a non- 
geometric analogue of the last case, where we “lost”? a dimension, is 
provided by the following example. 


Example 5 


Let P, be the vector space of all polynomial functions of degree 3 or less. 
(This space has dimension 4, and a set of base vectors is given below.) 
The differentiation operator: 


D:pr— Dp (p € P3) 


maps the space P, to the space P, (which has dimension 3). In this case 
we could take as a basis for P; the set of functions: 


le (x e R) 
iXt——X xeR 
h Basis for P, 
exe (x e R) 
A P (x e R). 


This set of functions maps to the set 


Eo 0 (x e R) 
dixil (x e R) 
d XE >25 (x e R) Basis for P, 


fuxe—— (ER. 


Like the previous example, this mapping is many-one. (For example, 
all the functions of the form: 


fx—x+a  (xeR) 
where a e R, map to the function f.) 


Note that fo is the zero vector in P}. It cannot belong to any basis for P}, 
since a basis must be a linearly independent set of three vectors. For 
consider the set { fo, g, h}, where g, h e P,. Since 


afo + Og + Oh = fo, 
where a is any non-zero real number, we see that any set of three elements 


containing f is linearly dependent. H 


Exercise 2 


We seem to be pinning a lot of faith in choosing a convenient basis. Is the 
choice of basis important? We shall resolve this difficulty later, but one 
point can be considered here. 


In Example 5, instead of f; and fı, we could choose go and g,, where 
Ts (x e R) 
A (x e R), 


and then none of the base vectors maps to the zero vector under D. Is 
(go. gi. fh} a basis for P,? a 
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Solution 1 


Discussion 
* * 


Example 5 


Exercise 2 
(2 minutes) 
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Exercise 3 Exercise 3 
(2 minutes) 


Fill in the gaps in the solution to the following problem. 


Problem 
If Tis a mapping from R? to R?, and 


ny (33), 


describe the image of the ellipse 
(65 y):9x? + 4y? = 36). 
Solution of Problem 
x y 
Put u = = and v = — 
utu —- and v 3 
Since x and y satisfy the equation 


9x? + 4y? = 36 


u and v satisfy the equation 


(i) 


(i) 


(iii) 


Exercise 4 Exercise 4 
z 3 z (4 minutes) 
(i) By choosing a suitable basis and finding its image, or otherwise, 


describe the effect of the following mappings of R? to R?. 
(a) T, :(x, y) — (y, x) 

Rp ud 

FIFI Sf 


(ii) Find an element in R? which is unchanged under the mapping T,. Ni 


(b) T; :(x, y) —> 
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Solution 2 Solution 2 


ghix-— 1 (xe R) 
gi:xt—3-—1 (x € R). 


Although none of the base vectors is mapped to the zero vector, 
(go; £i f 3} 1 is linearly dependent, since 


igo (— Jg; + 0f5 = fo; 


£o and gi cannot both belong to the same basis because one is a scalar 
= 2 = other. - a 


Solution = . Solution 3 
(i) Qu)? + 43v) = 36 
(ii) u? + 0? = == 

Gii) ((u, v):u? + v? = y 
(iv) the origin; 1 

Thus the effect of the mapping is to transform the ellipse to the circle as (See RB4) 
shown below. 


- 


Solution = s E : Solution 4 
(i) If we choose the basis {(1, 0), (0, 1)), we have: 


(a) T, :(1,0)——(0, 1) 
T, :(0, 1)—— (1, 0) 


The effect of T, is to reflect the points of the plane in the line with 
-equation y = x (we are = interchanging the x and y co- 
ordinates). — 


"sae A 
E 

1 | 
ec e 


(0,1) 


The effect of T, is to rotate the plane about the origin through an 
angle = (If you are not convinced, find the images of some more 


points.) 

(ii) Any scalar multiple of (1, 1) is an invariant element under T, . In terms 
of geometric vectors, any vector in the direction of the line with 
equation x = y is invariant. All such geometric vectors are scalar 
multiples of į + j. m 


23.1.2 Morphisms 


There are many interesting and useful results concerning mappings of 
vector spaces, but the most fruitful field of study consists of those mappings 
which are homomorphisms or isomorphisms (and therefore, of necessity, 
functions). 


If Vis a vector space, the two operations we have defined on the elements 
of V are addition of vectors and multiplication of a vector by a scalar. 
Suppose a function T maps a vector space V, with an addition operation 
+,, to a vector space U, with an addition operation +,. The additive 
structure will be preserved if, for any vectors v, and v, in V, 


To, *,2:)— T(v,) +, T(v5) 


Since we have been abusing the symbol + throughout this unit (we have 
defined all sorts of methods of addition and called them all +), we shall 
continue to do so, and we drop the suffices u and v from the addition 
symbols. We then have 


T(v, + v3) = T(v4) + Tx) 
as the condition that T should be a morphism for the addition operations. 
For the other operation we require that 

T(av) = aT(v), 
for any real number « and any vector ve V. 


Equations (1) and (2) are the conditions that T should be a morphism 
from the vector space V to the vector space U* 


The two equations can be combined to give the following equation: 
T(a,v, + &202) = o, T(v,) + a, T(v5), 
for any real numbers «,,«,, and any vectors v, and v, e V. 


In many books, a mapping of a vector space to a vector space is called a 
transformation, and when the mapping is a morphism it is called a linear 
transformation. This is another example of calling a particular type of 
mapping by a special name. (We have already used the word operator.) 


* Previously (see Unit 3, Operations and M orphisms, page 25), we said that a morphism fwasa 
mapping from A to f (A). Here, T(V) may bea proper subset of U. (In fact, if Tisa morphism, 
T(V) is a vector space, but we have not proved this yet.) 
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23.1.2 


Discussion 
Y ok 


Equation (1) 


Equation (2) 


Equation (3) 


Definition 1 
* 


Exercise 1 


Which of the following mappings are morphisms? (Take the operations 
in the various vector spaces to be the usual ones.) 


(i) The mapping of R? to R? such that 
T:(x4, x3). (x5. x1) 

(ii) The mapping of R? to R? such that 
T:(x,, x2) —9 (x4, x2) 


(iii) The mapping of the set of all geometric vectors in a plane to R such 


that 
Tx — aX 


=> 


where ais a given geometric vector, and the dot stands for the inner 
product as defined in Unit 22. 

(iv) The mapping of the set of all polynomial functions of degree n or less 
to itself such that 


T :p—>the derived function of p. 


(v) The mapping of the set of all real functions, which are twice-differ- 
entiable at all points in A, to the set of all real functions, such that 


T: f —92D?f + Df + 3f e 


Exercise 2 
Let L be a morphism from a vector space V to a vector space U. 


Complete the gaps in the proof of the following theorem. 


THEOREM 


If the zero element of V is v, (i.e. Uo is the element for which v + to = v 
for any ve V), and if uj is the zero element of U, then L(vo) = uo. 


PROOF 


Since vV + Vvo =D 


L(v + vo) = e y] (a) 


But L is a morphism, so 


AS 0 


From (i) and (ii), Lv) + L(vo) = L(v), 


so subtracting L(v) from both sides, we see that 


L(vo) is the element of U. (c) 


Confirm this result for each of the mappings which are morphisms in 
Exercise 1. E] 


Exercise 2 shows that under a morphism the zero element in the domain 
vector space is mapped to the zero element in the codomain vector space. 
We shall use this result in proving the following theorem, which is funda- 
mental to the study of morphisms of vector spaces. 


10 
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Exercise 1 
(3 minutes) 


Exercise 2 
(4 minutes) 


Main Text 
wk 


THEOREM 


If Lis a morphism from a vector space V to a vector space U, then I(V) 
is a subset of U which is itself a vector space. 


(L(V) may be U or a proper subset of U.) 


L(v) 


METHOD OF PROOF 


We have to prove that the set L(V), with the operations of the vector space 
U, satisfies the vector space axioms listed in Unit 22. To save you the 
trouble of referring back, we list the axioms for a vector space V; ou, 
7,,v, are any elements of V, and wand $ are any real numbers: 

Y, + v € V and is unique. 

vı + (v2 35 13) = (v, zi V2) + 03 

Uy + 0, = V, + Ui 

There is an element, v,, in V, such that 


AU bt -— 


U + vo =V 


“ape V 

E E 

afv, + v2) = (avi) + (av) 
(x + Pw = av + Bv 

(aB)u = o(Bv) 


l01xv-2v 


NO 0 —) QN tA 


Axioms 2, 3, 6, 7, 8, 9 and 10 are statements about all elements of a vector 
space, and since U is a vector space we do not have to check these axioms 
for L(V). On the other hand, axiom 4 is a statement that a particular 
kind of element (the zero element) belongs to a vector space. Clearly, the 
zero element of U will not belong to every subset of U, so we have to prove 
that it belongs to the particular subset L(V). Axioms 1 and 5 concern 
closure. If L(V) is to be a vector space, then any combination of elements 
in L(V) must give resulting elements still in L(V). This again is not 
necessarily true for any subset of U, so we must check it for L(V). 


PROOF 


We have to prove that axioms 1, 4 and 5 hold for L(V). We have three 
pieces of information: 


(i) V is a vector space; 
(ii) U is a vector space; 
(iii) L is a morphism. 
We have used (ii) to dispose of axioms 2, 3, 6, 7, 8, 9 and 10, but we have not 
yet used (i) and (iii). 


We have proved that axiom 4 holds for L(V): in Exercise 2 we proved 
that under a morphism the zero element v, in the domain V maps to the 
zero element uo in the codomain U. So the zero element of U belongs to 


LV). 


Let us have a look at axiom 1; we must show that L(V) is closed under 
addition. If u, and u, are any elements of L( V), then there are elements 
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Theorem 


* ko 


Discussion 
* 


Main Text 
*o* 


(continued on page 13) 


Solution 1 
(i) T((x1, x3) + 01. y23) = T6a + y1; X2 + Y2) 
= (x2 + Y2,X1 + yı) 
= (x2; X1) + (2. V1) 
= T(x,, x3) + T1, 2), 
and 


T(o(x,, x3)) = T(ax,, 0x2) 
- = (0x2, 0x1) 
= W(X, X1) 
= aT (x1, x3). 
so T is a morphism. 
(ii) aT (x1, x) = o(xt, x3) = (axi. 0x3), 
and 
T(o(x,, x3) = T(ax,, 0x2) = (a?x7, c? x3) 


Equation (2) is not satisfied, so T is not a morphism. 


(ii) T(x + y) = a-(x + Y) 
=q-x +4: y( is distributive over +) 
= T(x) + TQ) 
and 
T(ox) = 4- ox 
= ala - x) 
= aT (x) 


so T is a morphism. 
(iv) T is a morphism | These results follow directly from the properties 
(v) T is a morphism ( of D. [| 


Solution 2 > 


(a) v 

(b) v, Yo 

(c) zero 

For the morphisms of Exercise 1, we have: 
(i) (0, 0) —> (0, 0) 

(iii) 0—0 

(iv) (x= 0) > (a 0) 
(v) (x —9 0) — (x — 0) 


You may have noticed that for many-one mappings, the zero vector is 
not the only vector which maps to the zero vector. For example, in (iv) 
we also have (x> k) mapping to (x+—— 0), where k is any real number. 
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Solution 2 


vı and v; in V such that 
u, = L(v,), 
u2 = L(v;). 
Then 
uy + uz = L(x) + L(v;) 
= Lv, + v5) (because L is a morphism) 
= L(v,), 


where v, is an element of V (by axiom 1 for the vector space V); L(v;) is 
an element of L(V), so u, + u, belongs to L{V), and L(V) is closed. 


The other closure axiom, number 5, is easily checked. 
If 


then 
au = aL(v) 


= L(ov) (because L is a morphism). 


Therefore au e L(V). 
This completes the proof. 


If a subset of a vector space U is itself a vector space, then we call it a 
vector subspace of U. 


Summary 


So far in the text, we have considered mappings of one vector space to 
another, and we have concentrated our attention on those mappings 
which are morphisms. A morphism has the property that the image set 
itself is a vector space : we have seen that the properties of commutativity, 
associativity and the zero element are carried over from the domain to its 
image set by a morphism. 


An interesting feature of some of the morphisms we have met is that they 
map a vector space on to an image set which has a lower dimension. For 
example, we have had mappings of planes to lines, polynomials of degree 
n or less to polynomials of degree n — 1 or less, and so on. Two questions 
arise. What has happened to the “lost” dimensions? Can we predict 
in advance when we are going to “lose” a dimension? We shall look at 
these questions in the next section. 


Exercise 3 
(i) The mapping from R? to R? defined by 
L:(x;, x2) —9 (— x5, x2) 


is a morphism. Prove directly by verifying the axioms that L(R?) is 
a vector space. 
(ii) The mapping from R? to R? defined by 


T :(x1, x3) —* (x1, X3) 


is not a morphism. Show that T(R?) is not a vector space by finding 
an axiom which is not satisfied. 


(Note that we have not proved that if T:V-—> U is not a morphism, 
then T(V) is not a vector space. For a general mapping T, we do not 
know anything about T(V) if T is not a morphism.) E 
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Exercise 3 
(3 minutes) 


Solution 3 


(i) As in the proof of the theorem, we need to check axioms 1, 4 and 5 
only. 
Axiom 1 The elements of L(R?) are of the form 
(—a,a).aeR, 
and 
(—a, a) + (—b,b) = (—(a + b), a + b). 
and so axiom 1 is satisfied. 
Axiom 4 Consider the element (0, 0). 
L((0, 0)) = (0, 0), 
so that (0, 0) e L(R?), and axiom 4 is satisfied. 
Axiom 5 (—a, a) is any element of L(R?) 
o(—a, a) = (— aa, aa), 
and so a(—a, a) e L(R?): axiom 5 is satisfied. 
(ii) The only axioms which may not be satisfied are 1, 4 and 5. Of these 
only 5 is not satisfied. 
a(x1,X3) = (axi, ax>) 


and if « is negative, «x? is also negative, and so cannot be written as 
the square of a real number. [| 


23.1.3 The Kernel 


Let us have another look at Example 23.1.1.4 in which we saw that the 
morphism 


L:q,y)—9(-»y) (y eR?) 


maps the plane to a line. First, we looked at a particular basis, and saw 
that one of the base vectors mapped to the zero element (0, 0) in the codo- 
main. We then investigated the mapping by looking to see what happened 
to particular subsets of the plane. We saw tlrat any line parallel to the 
x-axis mapped to a single point. 


This raises two questions. Firstly, is it significant that we lose one basis 
vector and we lose one dimension? Secondly, it is all very well in this 
simple case to pick out a few significant subsets that tell us such a lot. 
We picked them out because we knew their properties. Consider now the 
images of the lines parallel to the y-axis in the domain. Any such line 
maps to the entire image set, for suppose we take the line for which x — a, 
then 


Liay)— —c3(—»»)- QER). 


y y 
| x N x 
By considering certain subsets of the domain, we find that we can obtain 
information about L. Is there any particular subset which we can most 
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23.1.3 


Discussion 
* * 


profitably consider? That is, can we describe a subset in the domain 
which will give us information about L in a form which we can interpret 
easily? If so, can we extract any general feature which will help us with 
other examples? 


The clue is in our observation about the loss of a basis vector. The vector 
(1, 0) maps to (0, 0), but of course it is not only this vector which “shrinks” 
to zero—every multiple of (1, 0) maps to (0, 0). (Because the mapping is a 
morphism, L(a(1,0)) = aL(1,0) = (0,0).) So a whole set maps to (0,0). 
Why consider this particular set? Try the next exercise. 


Exercise 1 


The vectors (0, 1) and (2, 2) form a basis for R?. Calculate L(0, 1) and 
L(2, 2), where L is the mapping we have been discussing: 


L:(x, y)*—>(—y,y)  ((y)e R?). 
What happens to the linear independence of (0, 1) and (2, 2)? E 


Exercise 1 shows us that, in our original basis, the choice of a vector 
which mapped to (0, 0) was purely fortuitous. (See page 4.) It may so 
happen, as in this exercise, that none of the base vectors maps to (0, 0), 
even though we “lose” a dimension. But here the whole set ((a, 0): 4. € R} 
maps to (0, 0), whether or not one of its elements is in the basis. 


It seems, then, that the set which maps to (0, 0) tells us something about 
the “lost” dimension. In this case the set which maps to (0, 0) has dimen- 
sion 1 (every element of the set can be obtained as a scalar multiple of 
(1, 0), and we lose just one dimension. Let us have a look at two more 
examples, one where we again lose one dimension and one where we lose 
more than one dimension. We shall again take geometrical examples 
because geometrical situations are easy to visualize. 


Example 1 
The mapping 
L:(x, y, z) —9 (x, y, 0) 


is a morphism of R? to R?. The image of any point P is the point at the 
foot of the perpendicular from P to the plane with equation z — 0. Thus 
the domain maps to a plane. 


P(x,y, z) 


(x y,0) 


This morphism maps a 3-dimensional space to a 2-dimensional space: 
we lose one dimension. The set which maps to the zero element (0, 0, 0) 
in the codomain is the set {(0, O, z):ze R}, that is, the z-axis. This set is 
itself a vector space and its dimension is one, the same number as the 
number of “lost” dimensions. m 
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*ox 


Example 1 


(continued on page 16) 


Solution 1 

L(0, 1) = (—1, 1) # (0, 0) 
and 

(2, 2) = (—2, 2) 7 (0, 0), 


but 
—2L(0, 1) + L(2, 2) = (0, 0) 


Although neither vector maps to zero, the pair of linearly independent 
vectors maps to a pair of dependent vectors. So although the original 
vectors form a basis for R?, their images do not. E 


(continued from page 15) 


Example 2 
The mapping 
L:(x, y; z) => 2x; 0) 


is a morphism of R? to R?. The image of the point (x, y, z) depends only 
on its x-co-ordinate. Thus (1, 2, 3), (1, 6, 7), (1, 6, 99) all map to the point 
(1, 2, 0). Every point in the plane with equation x — 1 maps to this point. 


z z 
4 


. 
(1,2,0) 


x x 


Similarly, every point on the plane with equation x = 2 maps to the point 
(2, 4, 0) — and so on. Every plane perpendicular to the x-axis maps to a 
point on the line defined by the equations: 


2x —y 20 
z= 0; 


and the entire three-dimensional space maps to this complete line. 
E z 
x 
x 
LR?) = {(x, y, z):2x — y = 0,z = 0} 


The three-dimensional domain maps to a space of dimension 1. We 
seem to have lost two dimensions. 


< 


Thus 
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Example 2 


Which set maps to the zero element? In this case the zero element is 
(0, 0, 0), and the set which maps to (0,0,0) is the set {(x, y, z):x = 0}, 
ie. the yz-plane; this is itself a vector space and has dimension two. 
Notice that in this simple case we can use the “‘basis argument" again — if 
we can find an appropriate basis. Taking the set of vectors ((1,0, 0), 
(0, 1, 0), (0, 0, 1)} as a basis, we see that both (0, 1, 0) and (0, 0, 1) map to 
(0, 0, 0), and so we “lose”? two base vectors. In fact we “lose” any vector 
which can be expressed as a linear combination of these two vectors 
(i.e. the points in the plane with equation x — 0), because 


L(o(0, 1, 0) + (0, O, 1)) 
= aL(0, 1,0) + BL(0, 0, 1) = (0, O, 0), 
since L is a morphism. | 


We have seen that the subset of the domain which maps to the zero element 
in the codomain plays an important part, so we now give it a name. 


If Lis a morphism of a vector space V to a vector space U, and if uy is the 
zero element in U, then the set 


{v:v e V, Liv) = uo} 


is called the kernel* of the morphism. (Another name which is in common 
use for this set is the null space.) We shall denote the kernel by the letter 
K. 


There is one important point to notice here. In Unit 22, Linear Algebra I, 
we defined a basis of a vector space V to be a linearly independent set of 
vectors in V which spans V. We defined the dimension of V to be the num- 
ber of elements in a basis. Now the kernel of a morphism may only contain 
the zero element of V; i.e. we may have K = (0). 


We write O instead of v here, because you may like to refer to our dis- 
cussion of the vector space {0} in section 22.2.2 of Unit 22. There, we proved 
that 


aQ = 0, 

where «is any real number. This means that (0) is a linearly dependent set ; 
that is, {0} does not possess a basis. We adopt the following definition. 
The dimension of the zero vector space, {0}, is zero. 
The kernel has some quite remarkable properties. We have already 
hinted at two of them which are printed in red below. 
(1) The kernel itself is a vector space. 

We have shown that L(V) is itself a vector space, and in Examples 1 


and 2 we have seen that 
(2) (dimension of L(V)) — (dimension of V) — (dimension of kernel) 


The proof of (2) is given in the Appendix at the back of this text. The proof 
of (1) is not difficult; we have set it as an exercise. 


* This is in accordance with the ordinary use of kernel for the nucleus or core of a structure. 
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Exercise 2 


Find the kernel of each of the following morphisms: 
MT x9 (x1, X2, 0) 

where T maps R? to R?. : 
Gi) T':(x1, x3) 9 (x1 + X2, x, — 2x2) 

where T maps R? to R?. 


In each case find the dimension of the kernel and verify statement 2 in 
the text. (E 


Exercise 3 
Let L be a morphism from a vector space V to a vector space U. Show 
that the kernel of Lis a vector subspace of V. 


(HINT: How many of the axioms of a vector space need proving for the 
kernel? (See section 23.1.2.)) E 


23.1.4 Some Properties of the Kernel 


It is remarkable how much we can tell about a morphism just by consider- 
ing its kernel. 


Let L be a morphism of a vector space V to a vector space U, and let K 
be its kernel. If ke K and ve V, then 


L(v + k) = L(v) + L(k) (Lis a morphism) 
= Liv) + ug (definition of K) 
= L(v) (axiom 4 for U), 


where uy is the zero element in U. So v and v + k, where k is any element 
of the kernel, have the same image. 

Suppose now that we want to find all the elements in V which map to a 
given element u e U, and that we know one such element v, i.e. 


Lv) = u 


Then we know immediately that v + k, for all ke K, are such elements, 
and the remarkable thing is that they are in fact all the elements which 
map to u. We can prove this as follows. Suppose v, e V maps to u, i.e. 
L(v,) = u. Then consider v, + (— 1)v. We have 


L(v, + (— 1)0) = L(vi) + (- 1L(v) (Lis a morphism) 
=u + (—1)u (by hypothesis) 
= Uo (axiom 6 for U). 
But the kernel K contains all those elements which map to uy, so 
vı +(—lv =k, 


for some k, € K. By axiom 1 for V, k, is unique. Adding v to both sides, we 
get 


v =v + k, 


and so v; is of the form v + some element of the kernel. This result has 
important consequences: for instance, if the kernel contains a finite 
number of elements, n say, then we know that exactly n elements of V 
map to any given element of L(V). In particular, if the kernel contains 
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Main Text 
Rh od 


just one element (which will necessarily be the zero element in V), then we 
know immediately that L is one-one, i.e. an isomorphism. The following 
examples apply this discussion. 


Example 1 
Apply the above ideas to finding the solution of the equations 
2x + 3y —z=1 
x+y—-—z=2 


in terms of vector spaces. E 


Solution of Example 1 


One way of expressing the problem is to say that we want to find the set 
of triples (x, y, z) which satisfy these equations. If L is the mapping from 
R? to R? defined by 


L:(x,y,z)—— (2x + 3y — z,x + y — 2) 
then we want to find the set which maps to (1, 2). 


We have seen that we can describe the set which maps to any particular 
element when we know the kernel and one element of the set. In the con- 
text of this example, this means that, if we can find one solution to the 
equations, we shall be able to find all the solutions, simply by adding to 
that solution each element of the kernel. So we have to find one solution 
and we have to find the kernel. 


. One Solution 


If we give x or y or z a particular value, then the equations will be reduced 
to two equations in two unknowns, which we can solve easily. 


For example, if we put z — 0, then we obtain the two equations 

2x + 3y=1 

x+y=2, 

which we can solve to give 

=> y= —3. 
So one solution of the original equations 

2x + 3y—z=1 

x+y=z=2 
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Solution 23.1.3.2 Solution 23.1.3.2 


(i) ((0,0, x3):x3 € R}. Any element in this set is a scalar multiple of 
(0, 0, 1). 

(ii) ((x1, x2):x, + x; = 0 and x, — 2x, = 0) 

The pair of simultaneous equations 


X1 + X5 = 0 
X1 = 2x3 = 0 
has the single solution x, = 0, x; = 0. Thus the kernel is the set {(0, 0)}. 


The dimensions of the kernels are as follows: 
(i) 1. The dimension of the domain is 3, the dimension of its image set 
is 2, and 3 — 2 = 1. 
(ii) 0. The dimension of both the domain and its image set is 2. Note that, 
by defining the dimension of (0) to be zero, we have ensured that (2) is 
satisfied when K = {Q}. [| 
Solution 23.1.3.3 Solution 23.1.3.3 


We need only prove that the elements of the kernel satisfy axioms 1, 
4 and 5. As usual, we denote the kernel by K. 


Axiom I If k, and k, belong to K, we want to prove that k, + k; e K. 
We recognize elements of K by the fact that under L they map to uo, 
the zero element in U. 


Uk, + k;) = L(ki) + L(k;) (Lis a morphism) 
= Mo + Uo (k,,k,€K) 
= Uo (axiom 4 for U) 
Therefore, k; + k, e K. 


Axiom 4 We have already shown (Exercise 23.1.22) that L(vo) = uo. 
Therefore v; e K. 


Axiom 5 Let ke K, then 


L(ak) = aL(k) (L is a morphism) 
= Oo (ke K) 


This last step has been proved in Unit 22, Linear Algebra I, p. 40. The 
verification of axioms 1 and 5 can be amalgamated by using the definition 
of a morphism in the form given in Equation 23.1.2.3. E 


20 


wey 


The Kernel 


The kernel, K, is the set of triples (x, y, z) which map to (0, 0), i.e. which 
satisfy the equations 


2x + 3y—z=0 
x+y-z=0 


Just as before, we can solve these equations by giving x or y or za particular 
value and then trying to solve the resulting two equations in two un- 
knowns; but this time it is not much help, because we simply get the one 
solution, and we want all solutions. But if, for example, we give z a general 
value and put z = k, then these equations become 


2x + 3y =k 
x+y=k, 
which we can solve to give- 
x = 2k, y= —k. 


So one element of the kernel is (2k, — k, k), and by varying k we get all the 
elements. So K is the set 


(Qk, —k, k):k e R}. 


(5,-3,0) 


We want to find the red shaded set in the above diagram. 
We know one element — how do we find the others? 


Any solution of the original equations 
2x+3y—z=1 
x+y-z=2 
is obtained by adding an element of the kernel to (5, — 3, 0). 
So the complete solution set is 
{(5 + 2k, -3 —k, k):ke R}, 


and the theory we have developed assures us that this set contains all 
the possible solutions to the original equations. 


(Check that these are solutions by substituting into the original equations.) 
Notice that we can solve related problems like 


2x + 3y-z=7 
x+ y-— z= 99, 


where the right-hand sides of the equations are changed, very quickly — 
all we need to do is to find one particular solution and then add on each 
of the elements of the (same) kernel. We shall discuss problems of this 
type in considerable detail in Unit 26, Linear Algebra III. E 
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Exercise 1 
By putting x = 0, find a particular solution of the equations 
2x + 3y—z=2 
x+y+z=1 


Find the solution set of the equations. H 


Exercise 2 


We can map the vector space P,, of all polynomial functions of degree 
n or less, to itself by the differentiation operator: 


D:pr-—p (QEF): 


We have already seen that this mapping is a morphism. What is the kernel? 
What significance does this have in integration? [| 


Problems such as the following often arise in mathematics. We are given 
a mapping, M say, from a set A to a set B, and are required to find all the 
elements x of A such that 


M:x— b, 


where b is a given element of B. That is to say, we have to solve the equa- 
tion M(x) = b. Sometimes there is no solution (if b does not belong to 
M(A)); sometimes there is a unique solution; sometimes there are many 
solutions. In a very wide class of problems, A and B are vector spaces and 
M isa morphism. We have just seen that in these cases we can (in principle) 
adopt a standard procedure. We first find the kernel, K, that is, the solu- 
tion set of 


M(x) = 9. 


(This is usually a much easier problem than the original one.) We then 
find any one particular solution of 


M(x) = b, 


and then combine this with each element of K, to get the complete solution 
set. 


We shall apply these techniques later in the course, and we shall also 
meet the idea of a kernel in contexts other than that of a vector space. 
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23.1.5 An Algebra of Morphisms 


We shall now use some of our earlier work on functions. In Unit 1, 
Functions we saw various ways in which functions could be combined. 
In particular, we saw that there are two distinct types of combination: 
composition, or combination by successive performance, and the “arith- 
metic" combinations which can be “induced” from operations defined 
in the codomain. In Unit 1, we gave particular examples of this latter 
type. We saw that when the codomain is a set of real numbers we can use 
the operations of addition, multiplication and so on, to define addition of 
functions, multiplication of functions, and so on. 


If T, and T, are functions mapping a vector space V to a vector space U, 
then we have two vector space operations defined on U which can be 
used to define new functions of V to U: 


T, + Tj: — T,v) + Tv) — (veV) 
aT, :9——9» of T, (v)) (veV) 


where « is a real number. 


Exercise 1 


Show that, if T, and T; are morphisms, then T, + T; and «T, are both 
morphisms. (If you find that you need to look up the solutions to this 
exercise, then do so, but the solutions will have little value if you only 
read them, because they are cluttered with symbols. So once you have 
read them, try writing out at least one part on your own.) [| 


Exercise 2 


Given two vector spaces V and U, we can consider the set of all morphisms 
from V to U. It is a remarkable fact that this set of functions with the 
operations we have just defined is itself a vector space. Prove this state- 
ment. 


(You will need to check each of the axioms of a vector space. They are 
listed on page 11.) [| 


Finally, we shall show* that, if T, is a morphism of the vector space V 
to the vector space U, and T, is a morphism of U to the vector space W, 
then T, » T, is a morphism of V to W. With the usual notation, we have 


Tz ° T,(% 101 + 707) = TAT,(0,0, + a02)) 


T,(a, Ti(01) + o T;(v2)) 
(since T, is a morphism) 


= a, TX(Ti(v1)) + a, TX(T;(v5)) 
(since T; is a morphism) 


= q T, o Tix) + 5T;  T,(v2) 
i.e. Equation 23.1.2.3 holds, so T; » T, is a morphism. 


* What we prove here is a special case of the general result that the composition of two 
morphisms is a morphism. The only reason we prove it here is because we have not proved 
it before in the course: we could have proved it earlier in Unit 3, Operations and Morphisms. 
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Solution 23.1.4.1 Solution 23.1.4.1 
Putting x — 0 in both equations, gives 

3y —z=2 

y+z=1, 

which have the solution 

y-ihiz-i 
Thus one solution is 

x=0y=3iz=1. 
To find the kernel, we have to solve the equations 

2x + 3y —z=0 

x+y+z=0 

If we put x = k, we get 

3y — z = —2k 

y+2=-—k, 

which have the solution 

y = —3k,z = —4k; 
so the kernel is the set 

{(k, —¿k, —4k):ke R}. 
The complete solution is therefore 

{(k, 2 — 3k i — 4k):ke R}. 


(You may recall from Unit 15, Differentiation II that the equations 
2x + 3y — z = 2 and x + y + z = 1 are equations of planes. The set 
of points 


{(k,# — ¿ik i — H0:keR) 


corresponds to the line formed by the intersection of these two planes.) Mi 


Solution 23.1.4.2 Solution 23.1.4.2 


The kernel is the set of all polynomial functions which map to the zero 
function (x —— 0 (x e R)); that is, the set of all constant functions, 


Uf ix—9 k (xe R), ke R}. 

The problem of integration is that of solving equations of the form 
D(p) = f, 

where f is given. 


Since the kernel contains an infinite number of elements, the integration 
problem has an infinite number of solutions. If p is one solution, then the 
set of all solutions is 


{p + f:f:x-——>k (ke R), ke R}. o 
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Solution 1 
Let T= T; + T. 
T(u + v) — (T; + Tu + v) 
= T,u + v) + Tu + v) (definition of T, + T;) 


= Tu) + Ti) + Tu) + TY 
(T, and T, are morphisms) 


= (Ti(u) + T,(u)) + (Tv) + Tilo) (axioms 2 and 3) 
= (T; + T;)(u) + (T; + T,)(v) (definition of T, + T,) 
= T(u) + T(v) 


Thus the first condition for a morphism is satisfied. Now let « be any real 
number. 


T(ou) = (T, + T;)(au) 


= T,(au) + T,(ou) (definition of T, + T;) 

= aTi(u) + aT(u) (T, and T, are morphisms) 

= oT, + T;)(u) (axiom 7 and definition of T, + T3) 
= «T (u) 


Thus the second condition is also satisfied: T is a morphism. (The proof 
could be shortened by using the definition of a morphism given by 
Equation 23.1.2.3, and showing that 


T(a,u; + a5u5) = a, T(ui) + a,T(u,).) 
We also have to prove that «T, is a morphism. We use Equation 23.1.2.3. 
aT, (4,4, + &5u5) = (T (au, + x5u5)) (by definition of aT) 
= g(x, Ti(u1) + 65 T1(u5)) (1; is a morphism) 
= (xo,)T;(u) + (a0,)T,(u,) (axioms 7 and 9) 
= o4(xTi)(u,) + a,(aT,)(u,) (axiom 9) 


as required. > [| 


Solution 2 


Closure under addition and multiplication by a scalar have been proved 
in Exercise 1 (axioms 1 and 5). 


The zero mapping 
0 :v7—— ug (ve V) 
belongs to the set because it is a morphism of V to U (axiom 4). 


The other axioms may be investigated directly from the definition of 
addition of functions and the definition of multiplication of a function 
by a scalar. 


It is important in this proof to be very clear about the equality of two 
functions; two functions are equal if they have the same domain and the 
same image for each element of the domain. [| 
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23.1.6 Summary 


We have concentrated our attention on those mappings of vector spaces 
which are morphisms. Under a morphism the image of a vector space is 
itself a vector space. A particularly important subset of the domain of 
such morphisms is the set of elements which map to the zero vector in the 
codomain. We call this set of elements the kernel of the morphism, and 
it has some remarkable properties. It is itself a vector space; its dimension 
tells us the difference between the dimensions of the domain and the 
image space; it tells us whether the morphism is many-one or one-one; 
it provides us with a method of solving equations such as L(v) — u, 
where L is a morphism of a vector space. If L is a homomorphism then 
it is, by definition, a many-one mapping, and so we can expect this equa- 
tion to have more than one solution. 


We saw that if we know the kernel of the morphism and if we know one 
solution of the equation L(v) = u, then we can easily generate every 
other solution by using the elements of the kernel. We shall see later in 
this course how this method has direct application to the solution of 
problems in the theory of linear equations, in complex numbers and in 
differential equations, and how an analogous method can be developed 
in group theory. We also showed that, given two vector spaces V and U, 
the set of all morphisms from V to U is itself a vector space. Finally, we 
showed that the composition of two (suitably defined) morphisms of 
vector spaces is itself a morphism. 


These results are listed below. 


Let L be a morphism from a vector space V to a vector space U. Then, 


(i) L(V) is a vector subspace of U; 
(ii) K = [ke V: L(k) = uo} is a vector subspace of V, called the kernel 
of the morphism ; 
(iii) (dimension of V) = (dimension of L(V)) + (dimension of K); 
(iv) if v, is a solution of L(v) = (u), then the set of all solutions of this 
equation is (v, + k:ke K}; 
(v) K = {vo} = L is an isomorphism; 
(vi) the set of all morphisms from V to U is a vector space; 
(vii) if L, is a morphism from U to a vector space W, then L, o L isa 
morphism from V to W. 
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23.1.6 


Summary 


zee 


eS 


23.2 MATRICES 


23.2.0 Introduction 


In this second part of the text we introduce a new idea — that of a matrix. 
A matrix is simply a rectangular (or square) array of numbers such as 
you might see on a bus timetable or on the board outside a cinema, 
showing the starting times of the films. Such arrays of numbers arise 
when we try to quantify almost any investigation. 


For example, the following two arrays might represent a survey of the 
voting inclinations in two constituencies. 


Party X Party Y Party Z Don’t Know 


Men 30% 33% 10% 2595 
Women 2594 50% 15% 10% 
Party X Party Y Party Z Don't Know 
Men 50% 40% S335 5% 
Women 40% 40% A 18% 


There are various ways of combining arrays. In this example, adding 
corresponding entries and dividing by 2 to give 


Party X Party Y Party Z Don't Know 


Men 
Women 


40% 337 75% 15% 
324% 45% Bier 14% 


might be a meaningful thing to do, to give a sort of “average” voting 
inclination. 


In the first section of this second part of the text we show how morphisms 
between vector spaces can be represented by matrices. This-correspond- 
ence between mappings and matrices leads us to define some ways of 
combining matrices. We may want to define special combinations for 
special applications, but the ones we define here are the most widely 
used. 


(There is a lot of algebraic and numerical manipulation in section 23.2. 
If you are unfamiliar with matrices, then it will not help you much just 
to read through the calculations. We advise you to work through all the 
calculations as you come to them, and just use the manipulations given 
in the text as a check for your own.) 


23.2.1 Linear Equations 


In the first part of this text we discussed several examples óf mappings 
from R? to R? or R° to R?, and we saw in section 23.1.4 that these mappings 
are connected with simultaneous equations. It is fairly clear that three 
equations, each expressing one of x1, x? and x5 in terms of x,, x; and x3, 
will tell us how to map (x;, x5, x3) to (x1, x5, x4). If these equations hold 
for all real values of x,, x5, x5, then they will define a mapping from R? 
to R°. For example, the three equations 


x, = sin x, 


sin X; 


Agu 
I 


x3 = sin x4 
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23.2 


23.2.0 


Introduction 
x* 


23.2.1 


Discussion 
* * 


Equations (1) 


define such a mapping, as do the equations 
X) =X, + X2 + X3 

X5 = 2x, — X3 

X4 = 2x5 + 3x3 


where the right-hand sides are linear combinations of x,, x, and x,. 


Exercise 1 
Do Equations (1) define a morphism from R? to R?? 


Do Equations (2) define a morphism from R? to R?? H 


It seems that any set of three equations of the form: 
XQ = d4X4 + 42X2 + 43X3 
x3 = b,x, + bx, + b3x; 
X3 = €4,X4 + C2X2 + C3X3 


where the a's, b's and c's are real numbers, defines a morphism from R? 
to R?. A set of equations of this form is called a set of linear equations, and 
our suggestion can be put in more general terms as in the following 
proposition. 


Any set of m linear equations, each expressing one of m variables 
X1, X5,..., Xm in terms of n variables x,,x5,..., x, 

defines a morphism of A" to R”. 

The proposition is easily confirmed, for if the equations are 

X4 = d4X4, + 02X2 + ++: + a,x, 


x5 = b,x, SE b,x> ap Ooo = b, X, 


X = q4X, + Q2X2 + :-- + q,X, 
they certainly define a mapping T of R" to R": 
dO: 15965 5996) (XS Ky ges XL). 


For the variables x,,x,,...,x, can each take any real value, and so 
(X1, X5,..., Xn) can be any element of A". We have to prove that T is a 
morphism, i.e. that 


DOE 0x5. x.) ecu Tx, Vaio vex X) 
where « is any real number, and 
T(x + Yi X2 Yao... XQ + Yn) 
= Phu e) F TY Y) ++ a) 


Exercise 2 


Prove that T is a morphism by proving the last two statements in the 
text. E 


We thus have the result that every set of linear equations of the form of 
Equations (4) defines a morphism. 


This in itself would not be so interesting, if it were not for the fact that we 
can represent any morphism from R” to R” by such a set of equations. 
We can do this as follows. 
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Equations (2) 


Exercise 1 
(2 minutes) 


Discussion 


Equations (3) 


Definition 1 


Theorem 
*o* o* 


Equations (4) 


Exercise 2 
(2 minutes) 


Discussion 
*o* 
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We take as a basis for R” the set of n vectors: Discussion 
e3552«(1, 0:00:30. — 0); 
e, = (0, 1,0,0,..., 0), 
ez = (0,0, 1,.0,..., 0), 


€, = (0,0, 0,0,..., 1). 
An arbitrary element of A" can be written: 
X = (X1, X3, X3,..., Xn) = X1€1 + X365 + X363 +--+ + Xp 
If T is a morphism from A" to R”, then 
TG = Ties + x223 + +++ + xe) = x,T(81) +--+ + x,T(e,). 


We see that the base vectors tell us the whole story. Each of the base 
vectors has an image in A"; suppose 


T(e,) = (a1, b,.....41) 
T(e;) E (az, b5,.... 5) 


MEN e pot) 
Then 


T(x) Llais D3, .... d4) 
+ xx(a5, b5,....q) 


Ea Dai) 
= (dX; + 07X7 + +++ + aux, bx, + +--+ b,x,,...): 
If 
Tx) Dll Or oe ees 
then 
T(x) = xe,  X5e5 +--+ + X nem 


(The basis (e,,€2,..., em} for R” is shown on page 41, written in terms 
of column vectors, instead of row vectors as we have here.) 


We have 
Xi = A,X, + 02X2 + --- + aX, 


X5 = Pixs + D536; mE ERO c DX, Equations (5) 


Xm dX, + q2X2 bo + ux, 
Thus, the morphism can be represented by Equations (5). 


There is a one-one correspondence between sets of linear equations and Theorem 
morphisms from A" to R”. Any set of linear equations of the form (5) p 
defines a morphism from R” to R”, and any morphism from R” to R” 

defines a set of linear equations of the form (5). 


We mentioned earlier that a morphism between vector spaces is often Discussion 
called a linear transformation. We now see why — the transformation aep 
refers to the mapping part, and linear refers to the properties we have 

just discussed. It is from here also that we get the term Linear Algebra. 


In Unit 26, Linear Algebra III, we shall take up the topic of sets of linear 
equations in much more detail. In the meantime we shall use the ideas 
we have just been discussing to introduce the topic of matrices, which is 
of great importance in Linear Algebra and mathematics generally, and 
which we shall use when we discuss linear equations in detail. 
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Solution 1 Solution 1 
Let x = (x1, x5, x3) and y = (y1, Y2, Y3). 
For Equations (1) we have the mapping S, where 
S(x) = (sin x,, sin x, sin x3) 
S(y) = (sin y, , sin yz, sin y3) 
S(x + y) = (sin (x, + yı), sin (x; + y2), sin (x3 + y3)) 
and 
S(x) + S(y) = (sin x, + sin y,,sin x; + sin y,, sin x, + sin y3) 


and so 
S(x + y) # Six) + Sy). 


For Equations (2) we have the mapping T, where 


T(x) = (x1 + X2 + x3,2x, — X2, 2x2 + 3x3) 


and 
T(x + y) = X1-+ yy + Xz + y2 + X3 + Y3, 
2(x, yi) — (x2 + yə), 
2(x + y3) + 3x3 + y3) 
= T(x) + T(y) 
Similarly 
T(ax) = aT (x) 


. where a is any real number. 


Thus Equations (2) define a morphism, but Equations (1) do not. E 


Solution 2 Solution 2 
The first statement follows from the fact that, for example, 


a,(xx,) + a5(ax5) + --- + a,(ax,,) 
= o(aix, + 43X3 + --- + a,x,). 
a process which can be applied to each of the equations. 
Similarly, we have, for example, 
ai(xı + yi) + a(x + y2) +++: + ax, + Yn) 
(ax; + d5X; + +++ + a4X,) + (01); + a3y + ++- + App) 


The second statement follows from an application of this process to each 


component of T(x, + y,,..., x, + y, [| 

23.2.2 Arrays of Numbers 2322 
We have seen that Equations 23.2.1.5 define a mapping from R” to R” Notation 
(given bases for R” and R”). Now (x,,x»,..., x,) and (x1, x5,. Im) de 


are simply arbitrary elements in the respective vector spaces ; the equations 
themselves are adequately specified by the array of numbers 


ay az ... a, 
bb uds 
dí 42 = dmg 
Such an array of numbers is called a matrix. Definition 1 


kk 
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Often we want to talk about this array as an entity in itself, as opposed to 
a collection of elements. To do this we either put parentheses round the 
array or refer to it by a single letter. Sometimes we do both, thus we write 


ay dot 7 ds — €— 


A or br SEI m rows 


dido -— Hw... E 
jai 1 
— n columns 
Thus, a matrix with m rows and n columns defines a mapping from AR" 
to R" (the bases being given or understood), and the mapping is generally 
referred to by the name of the matrix. 
Example 1 Example 1 
(i) The mapping of R? to R? defined by 
x’ = 2x + 3y 
y =3x-— y 
is represented by the matrix 
2 3 
=, 
(ii) The matrix 
12-3 
Een 
represents the mapping of R? to R? defined by 
Xx — X, + 2X; + 3X; 
x = 4x, + 0x, + 6x5 | 
Two matrices are said to be equal if they represent the same mapping Definition 2 
(with respect to bases which are given or understood). Thus matrices A 
and B are equal if they have the same number of rows and columns (they 
must represent mappings with the same domain and codomain), and the 


corresponding elements are equal. If A and B are equal, we write A = B. 
For example, 
> 
3 4 


12 
EN 
Se 3 3 4 
io e osa 
12| 124 

6 d al 

A A 1-—2— 3—4 
LOW. Se 


p 
0000 


Since a matrix is an alternative way of specifying a morphism, we should Discussion 
beable to tell something about the morphism just by looking at the matrix. E 
Investigating the properties of matrices is equivalent to investigating 

morphisms between vector spaces. 
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23.2.3 Combining Matrices 23.2.3 


We can represent a set of equations by a matrix, but we can also go one Discussion 
better and rewrite the equations in terms of this matrix. Consider, for 
example, the equations 


Xj 3x, + 2x, + Ix, 
Equations (1) 
x, = 1x, + 1x, + 3x, 


They specify a mapping from R? to R? under which 
(x, > X2, X3)— (x1 , x3), 


and are represented by the matrix 
[ 2 ') 
A= 
a E 
We can write the equations in the following way: 


(x4 > x2) = A(x, > X5, X3), 


but there are plenty of other ways in which we could write them. We shall 
choose a way which leads to results consistent with the existing literature 
on matrices. We write 


x 
(x4, x4) as M 


2 
and 
X; 
(x1; X2; x3) as | x; |; 
X3 
that is, we write the elements of R? and R? as arrays (or lists) of numbers, 
rather than in co-ordinate form. These lists are just special kinds of 


matrices, matrices with one column. So we can rewrite the equations in 
matrix notation as 


, Xi 
1 
| ‘Jean X3]. Equation (2) 
x2 
X3 
where [] is a combination of matrices which we have to define to make 
Equation (2) equivalent to Equations (1). 
X; 
From our definition of equality of matrices, A [Q] X | must be a matrix 
Xai 
with one column and two rows, and the first element must be x, and 
the second x5. But x, and x are given by Equations (1), so 
x, 
A E X5 == | 


X3 


3x, + 2x, + 2] 
Net Xa Ss 


Thus 


nl 3x, + 2x, + 1x4 
Ox] = 
lx, + 1x, + 3x, 
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and for a general mapping from R? to R?: 


ESE 


| E " + bB + a 
d ef e 


da + ef + fy 


The first element in the third matrix comes from the first row of the first 
matrix, and is obtained by multiplying each element of that row with the 
corresponding element in the second matrix and adding together the 
products thus formed. The second element comes from the second row 
of the first matrix by applying the same procedure. 


Similarly, 


| 1 
| Es `) | | Uxdi2x(-2 43904 
O |-2]= 
Ex (—2)x 1 +3 x (-2 + (-4) x3 


sia 


We can extend this operation to deal with longer lists and matrices with 
more rows and columns. This extension corresponds to manipulating 
equations specifying mappings from R" to R", where the column corres- 
ponding to an element of R" has n elements 


261 a a 3 
X2 bi b, n 

, and the left-hand matrix is | - - - m rows 
Xn 41 42 PU Qn 


This matrix must, of course, have the same number of columns as there 
are elements in the list matrix corresponding to an element of R", because 
of the form of the linear equations. 


Example 1 Example 1 
Gy ess 1 1x1+2x(-1)+3x2 5 

4 5 60 |-1|=|4x1+5x(-D+6x2]|=|11 

1589 2 7x1+8x(-1)+9x2 17 
(1/1 2 E 23 

34 E 

\5 6 183 


(iii) The set of equations 


2 3 — 2 
Le — 
3xj + Ix, =4 


can be written in the form 


2, 3 2 
Xi 
1 -—210 -xr3 
X5 
3 1 4 a 
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Exercise 1 Exercise 1 
Q minutes) 
Calculate 
ji —l 
lodo] e! ? Jal: 
ii y 
: —2 —1 0 4 3 
3 4 E 


At the end of section 23.1.5, we proved that the composition of two Discussion 
morphisms, their sum, and the scalar multiple of a morphism are all 
morphisms. It follows that we can define combinations of matrices 
corresponding to each of these combinations of morphisms. Here we shall 
consider the combination which corresponds to composition. We shall 
consider the other cases in section 23.2.5. 


Consider, for example, the morphisms T, and T, of R? to R? defined by 


allia a eont kama Lc Ac ti 


T, :(%1, x3)—— (1x, + 1x4, 2x, + 1x5) 
Di, x3)—— (1x, + 2x5, 3x, + 2x2) 


Under the composite mapping T, o T, we have 
~The Tx, x3). TT, 6G. x2)) 
and 
TT (1, x3)) = Thx, + 1x2, 2x, + 1x2) 
= (I(x, + x2) + 22x, + x2), Xx, + xj) 
+ 22x, + x>)) 
= (5x, + 3x2, 7x, + 5x2) 
Writing 
Tq, Xs) —641.3) and E6045) 995 x), 
we have: — = 
x, = Ix, +1 
A E. d Equations (3) 
x3 = 2xi £ 1x; 


and 


x 
= ix, £22, 2 Equations (4) 
Substituting from Equations (3) into Equations (4), we get 
xi = 5x, + 3x, 
xf = Tx, + 5x, EPRD 
as before. 


In terms of matrices, the matrix representation for T, is 


ES 
A, = 
2 1 
(see Equations (3)), and the matrix representation for T, is 
| E ( 3) 
IER 


(see Equations (4)). And we have shown that the matrix representation for 
Do T, is 


(see Equations (5)). 


If we use the symbol * to represent the operation of combining matrices 
which corresponds to the composition of mappings, we have 


A,* A, =A, 


We now know how to combine A, and A, in this case, but how can we 
work out B * A for any matrices B and A? Indeed, does B * A make sense 
for any matrices B and A? 


First of all, how do we calculate B * A? Looking at Equation (6), we see 
that the element in the first row and first column of A, is 5, and 


sedes us 


and this expression is obtained by multiplying the elements in the first 
row of A, by the corresponding elements in the first column of A, and 
adding together the products. 


Ede de: 


Combining rows of A, with columns of A, in this way, we get the other 
elements of A,. 


eH 
j * 
3 3 \2 


HM 4 


In the general case of matrices with two rows and two columns, we define * 


by 

[ 4 { , E da + fe eb ed 

g h c d ga t hc gb 4 hd, 
Exercise 2 
If 

Xi = ax, + bx; 

Xa = cx, + dx, (1) 
and 

A = ex. fx, 

" (ii) 


x3 = gx, hx; 


find x1 and x5 in terms of x, and x,, and compare your answer with the 
definition of *. | 


35 


FM 23.2.3 


Equation (6) 


Definition 1 


Exercise 2 
(2 minutes) 


(continued on page 36) 


L 


Solution 1 
(i) [5|], because 1 x 3 + (—1) x (—2) = 5 
6| and 2x 3+0 x (-2) =6 
1} and 3x 3+4-x (-2) = 
(ii) {14\, because 1 x 1+2x24+3x3 I 
ES (-1)x14+0x24+4x3=11 E 
Solution 2 


Substituting from (i) into (ii), we get 
xi = e(ax, + bx;) + f(cx, + dxj) 
X5 = g(ax, + bx) + h(cx, + dx;) 
i.e. 
x, = (ea + fc)x, + (eb + fd)x, 
x, = (ga + he)x, + (gb + hd)x, 
and the matrix representing these equations is 


ea+ fc eb 4 fd 
ga+ hc gb + hd m 


(continued from page 35) 


We can extend the definition of * to more general matrices. So far we have 
combined matrices with two rows and two columns — corresponding to 
combining a mapping from R? to R? with another mapping from R? to R?. 
It makes sense to combine mappings other than these; a morphism T, 
from R" to R?, say, can be followed by a morphism T, from R? to R”, say. 


We know that T, will be represented by a matrix with m columns and p 
rows, and T, will be represented by a matrix with p columns and n rows. 
So we can extend the definition of * to form B * A for all sorts of matrices 
provided the number of columns in B is the same as the number of rows in A. 
Since T, » T, isa morphism from R” to R”, the combined matrix will have n 
rows and m columns, and in general B * A will be a matrix with the same 
number of rows as B and the same number of columns as A. 


m m 
columns columns columns 


n p n 
rows TOWS TOWS 


We shall not follow through all the details, but illustrate how we might 
proceed in a simple example. 
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Solution 1 


Solution 2 


Discussion 
E 


Example 2 
Tu i 
IfB=|3 4| and A= | |, 
5 6 4 


how do we calculate B * A? A represents the mapping from R! to R? 
specified by the equations 


, 


xy = 1x; 
SA 
and B represents the mapping from R? to R? specified by the equations 
xi = lxi 2x, 
x3 = 3x1 4x; 
x3 = 3x1 + 6x5 


Substituting Equations (7) into Equations (8), we find that the composite 
mapping has matrix 


5 
BeA=|11 
17 
This is a matrix with three rows and one column ; we expect this because the 


result of mapping R* to R? (matrix A) and then R? to R? (matrix B) is a 
mapping from R* to R? (matrix B * A). 


B*A 


The elements in the matrix B* A are calculated by exactly the same 
method as before : the element 5 comes from the first row of B and the first 
(and only) column of 4:5 = 1 x 1 +2 x 2. The element 11 comes from 
the second row of B and the first (and only) column of A:11 =3 x 1+4 x 2. 
And17—25x14 6 x2. [ | 


If you look back at page 32 where we discussed the operation Q, you will 
see that [ ] is just a special case of *. We can define * for general matrices 
(to correspond to the composition of morphisms) as follows. 


The combination B * A is defined for any matrices B and A, provided that 
the number of columns in B is the same as the number of rows in A. If 


B*A-C, 


then the element in row i and column j of C is obtained from row i of B 
and column j of A ;itis calculated by adding together the termwise products 
of the elements in this row and column. 
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Example 2 


Equations (7) 


Equations (8) 


Definition 2 


FM 23.2.3 


Ap 


column 
J 


By, a, + B242 + B323 + --- + Ba, 
If B has m rows and p columns and A has p rows and n columns, then 
C = B* A will have m rows and n columns. 
Example 3 Example 3 


(Make sure you can obtain these results for yourself.) 


(i) 1 —2 1 3 —4 3 7 —6 
—1 3 li —2 d. —4 -9 7 
2 4 —2 -2 -4 
(i e-2-—3—4 5) eS 
1 
1—|5— (15) 
1 
1 
(iii) ] -2 1 3 —4 
= 3 A E 1 | is not defined, because 
2 4 1 2 3 
the number of columns in the left-hand matrix is not the same as the number 
of rows in the right-hand matrix. E 
Exercise 3 Exercise 3 
(3 minutes) 
(1) Calculate | 1 4 : b = |J 
—1 3 2 1 
(11) Calculate 1 3 5 1 1 
= Xo j : 2 2 
0 = 


(iii) Is * (a) commutative? 
(b) associative? mE 
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23.2.4 Some Special Matrices 23.2.4 


Rotation Matrices Discussion 


In the first part of this text we had several examples of morphisms specified 
by sets of equations, and we discussed their geometric interpretation. For 
example, we saw that the equations 


xi = —X, = Ox, + (—1)x; 
x= x=1lx+ 40x, 
can be regarded as specifying a mapping which rotates the plane through 
an angle a anti-clockwise about the origin. In terms of matrices, we can 
say that E = represents a rotation of the plane through ; anti- 


clockwise. 


Suppose we start the other way round and look for matrices to do some 
specified geometric job. 


Example 1 Example 1 


Consider a mapping which rotates the plane about the origin through an 
angle 0 anti-clockwise. 


X2 


(4, x2) 


If we call the corresponding matrix R,, then 


where x,, x5, x; and x5 are related as in the diagram. We can find Rọ easily 


1| [0 
by considering the basis i. NI for R?. 


A little trigonometry tells us that (See RB2) 
(0) es j 
> 
0 sin 0 


a 
X2 X2 


P'(cos 0,sin0) 


P (1,0) | > ^ 


X41 x4 
(continued on page 40) 


39 


Solution 23.2.3.3 

(i) {7 1 
im 

(ii) [7 2 
i> 


(iii) (a) It is easy enough to find a counter-example to show that * is not 
commutative. For example, 


[EIN 
AAA 


(b) This is more difficult, because * is in fact, associative. 


A direct proof for * could be tricky, but we can go about it another 
way. Remembering that * for matrices corresponds to composition 
of morphisms, which are just special kinds of functions, we can 
show that the composition of functions (where it is possible) is 
associative. Suppose A, B, C, D are sets and f, g, h are functions such 
that 


fA AB BEBA E CRD 


then if a e A, 
((heg)e f)(a) = (he g)Cf(a)) 
= h(g(f(a)) 
= hí(g > f)(a)) 
| = (ho (g ° f))(a) 
Thus, composition of functions is associative, which implies that 
* is also associative. m 


(continued from page 39) 


, | —sin 0 | 
and ===> 
1 cos 0 


X2 x2 
i Q(0;1) (-sin9, cos 8) Q’ 
| = NY 
| 3 
X4 x4 


R, has two rows and two columns because the mapping is one-one and 
the image set of R? is R?. Thus if 


T 
Ae dE 


we know that 


E dd = a 
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Solution 23.2.3.3 


and 
[ ^ N | —sin s 
* = 
cd 1 cos 0 
Since 
PLE 
* = 
ed 0 c 
and 
C at)-0 
* = 
Ca 1 d 
we find that 
a= cos, c= sin, b= —sin@andd = cos 0. 
Thus 


E 0 —sin 4 
sin 0 cos 0 


You may have noticed in this example that the columns of R, are precisely 
1 0 
the images of the base vectors | a and i under the mapping. This is not 


special to this example, for it is easily seen that if A is the matrix of a 
morphism T from R”, then the columns of A are precisely 
Ters 125 Tete 


Em> 


where €,,€,,€3,..., €, are the particular base vectors 


1 0 o 

0 1 0 

ey = 0 ,€2 == 0 > ->€m q 0 
(0 0. 1 


(Compare this with the discussion on page 29.) 


The subject of Linear Algebra is full of elegant yet simple results like this. 


Exercise 1 


Find the matrix which corresponds to a “stretching of the plane”, so that 
the distance of every point from the origin is multiplied by a number k. MW 


Exercise 2 


Find the matrix which corresponds to the identity mapping, which leaves 
every point in the plane unchanged. [iz] 


The Identity Matrix 


Under the operation of composition of functions, the identity function 
leaves every function unchanged. For example, the function: 


J -xx (xe R) 


is such that f o g = go f = g for any function g with domain and codomain 
R. This property has its counterpart in matrix algebra. If we write 


0 
E= l iP then I * A = A* I = A for any matrix with two rows and 
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Exercise 1 
(1 minute) 


Exercise 2 
(1 minute) 


(continued on page 42) 


Solution 1 


: 1 k 0 0 a = de 0 
Since xL and ——— | |, the matrix required is . B 
0 0 1 k 0 k 


Solution 2 
=d- 0 
The matrix is 5 
0 1 


(The special case of Solution 1 when k — 1.) | 


(continued from page 41) 


two columns. Indeed, if we write Im,m for the matrix with m rows and m 
columns, such that every entry is zero except for 1’s as the diagonal entries 
from the top left-hand corner to the bottom right-hand corner, then if A is 
any matrix with m rows and n columns, we have 


Lam A= A*I,,—4A 
Usually, the suffices are dropped, and we just write I for the identity 
matrix appropriate to the particular situation. 
Exercise 3 (Optional) 


You may find the following interesting. 


(i) What is the result of rotating the plane anti-clockwise about the origin 
through an angle a and then through an angle f? 
(ii) Complete the following equation: 


(iii) Complete the following equation 


sie [e 
E ue. oy 


(iv) Write down R, and R, and calculate A, * R, directly. 
(v) Compare your answers to (iii) and (iv). Can you deduce anything from 
this comparison? [| 


Bora = 
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Solution 1 


Solution 2 


Identity 
Matrix 


kk 


Exercise 3 
(3 minutes) 


23.2.5 More Ways of Combining Matrices 


The operation * arose from our interpretation of the composition of 
morphisms in terms of matrices, and was made possible because we had 
established the correspondence between morphisms and matrices. 


There is an interesting result concerning morphisms which we obtained 
towards the end of section 23.1.5, and which we now want to take up again. 
We showed that the set of all morphisms from a given vector space V to a 
given vector space U itself formsa vector space. We know that, for example, 
any morphism from A" to R" is represented by a matrix with n rows and m 
columns, and conversely, that any such matrix represents a morphism 
from A" to R”. So the set of all matrices with n rows and m columns forms 
a vector space (for given n and m) for operations corresponding to those 
for morphisms, i.e. addition of functions and multiplication of a function 
by a number. Itis fairly easy to see that the corresponding matrix operations 
are as follows: 


(i) If k is any real number, then kA is the matrix obtained by multiplying 
each element of A by k. 

(ii) If A and B are two matrices each with n rows and m columns, then 
A A B is the matrix with n rows and m columns obtained by adding 
corresponding elements of A and B. 


Example 1 
(i) 3| 1 2 3 6 
—1 3| = |-3 9 
4 7 12— 21 
(ii) 1 2 3 6 l +3 2 4+6 
—1 3 JA |-3 9 |=|(-1) + (—3) 3+ 9 
4 7 P= N 4 +12 A 
4 8 
= |-=4 12 
16 28 


(iii) 


1 a = [ —3 = 
—1 3 6 pee 
is not defined: it is equivalent to trying to “add” two functions with 
different domains. pa 


It is clear from the definition of A that it is both commutative and 
associative. Because of its obvious similarities to addition, we use the 
symbol + for A; for example, we write 


[ j | 2 E | 4 = 
+ = 

1 4 —2 —4 —1 0 

and we call the operation matrix addition. 


We can use these two operations to define a third operation — matrix 
subtraction — by 


A=B=A+(-1)B 
The matrix ( — 1)B is usually written as — B. 


So now we have three operations: *, multiplication by a scalar, and +. 
The two operations + and * are binary operations on sets of matrices, 
although we have to take some care over defining the sets in which they 
operate. We have noted some of their properties: + is commutative and 
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23.2.5 


Discussion 
wo 


Definition 1 
wkk 


Definition 2 
wkt 


Example 1 


Discussion 
** 


Definition 3 
*o* ox* 


Definition 4 
*o* o* 


(continued on page 44) 


Solution 23.2.4.3 


If we perform two rotations such as these successively, the result is a 
rotation through an angle f + a, which is of course the same as a rotation 
through a + f. Hence 
(i) A rotation through f + a. 
(11) Rg * Ra = Rara 
(iii) cos(f + à) —sin(B + > 
sin (f + a) cos (f + a) 


Bra = 


cos —sin cosa  —sin« 
Gv) &*&, = | 5 p -| | 


‘sin f cos f sin « cos Y 


E PB cos a — sin f sin a  —cos f sin a — sin f cos > 


sin $ cos x + cos P sina  —sin £ sin a + cos f cos « 
(v) Comparing (iii) and (iv), we obtain the formulas 


cos ($ + a) = cos f cos « — sin f sin « 


sin (f + a) = sin f cos a + cos f sin a E 


(continued from page 43) 


associative; * is associative but not commutative. We have not as yet 
seen how * and + interact. In particular, are they distributive over each 
other? 


Exercise 1 


By choosing some simple matrices, prove that + is not distributive over *, 
ie. that for some suitable* matrices A, B and C 


A 4 (B* C) z (A + B)*(A 4 C) [| 


It is a relatively simple matter to disprove a conjecture by a counter- 
example, as in Exercise 1, but to prove a conjecture we have to argue in 
general terms. We shall not do this here, because the proof does not 
illustrate anything very important (except, perhaps, a need for an improved 
notation, which will be developed later); however, we record the fact 
that * is distributive over +. That is, for suitable matrices A, B and C, 


A*(B + C) = (A* B) + (A* C) 
and 
(A + B)*C —(A*C) + (B* C) 
Notice that we must make both statements because * is not commutative. 


There is one difficulty that we must mention; it concerns terminology. 
It is widespread practice to refer to the operation * as matrix multiplication 
and drop the symbol *, writing AB instead of A * B. This is most unfor- 
tunate, for although * is associative, and distributive over +, just as for x 
and + in the set of real numbers, * is not commutative, and whereas + 
comes from addition of mappings, * comes not from multiplication but 
from composition. 


However, because the practice is widespread we shall refer to the operation 
as matrix multiplication for the remainder of the course, and drop the 
symbol *. 


In the next section we collect together some of the differences between 
matrices and numbers, and also some of the similarities of the two algebras. 


* By "suitable" we mean matrices with appropriate numbers of rows and columns, so that 
the operations are defined. 
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Solution 23.2.4.3 


Exercise 1 
(2 minutes) 


Discussion 
*o* 


23.2.6 Matrix Algebra and the Algebra of Numbers 


A major difference between matrices and numbers is that, whereas we can 
add or multiply any two numbers, this is not the case for matrices. If we 
restrict ourselves to a set of matrices with the same (fixed) number of rows 
as columns, then we can add or multiply any two matrices from the set. 
We take it as implicit, then, in this section that all the matrices have the 
same number of rows as columns, and that this number is the same for all 
the matrices. (A matrix with the same number of rows as columns is called 
a square matrix.) 
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23.2.6 
Summary 


** 


Similarities 
Numbers : Matrices 

Addition is commutative. a+b=b+a A+B=B+A 

Addition is associative. (a+ b+c=a+(b+c) (A+ B+C=A+(B+C) 

There is a zero element. a+0=0+a=a A+0=0 + A= A, where O 
is the matrix in which every 
entry is zero. 

Each element has-an (—b)+b=0 (—B)+B=0 

additive inverse. 

Multiplication is associative. (ab)c = a(bc) (AB)C = A(BC) 

There is an “identity” al =la=a AI = IA = A, where I is the 

element. identity matrix defined on 
page 42 

“Multiplication” is a(b + c) = ab + ac A(B + C) = AB + AC 


distributive over addition. (a + b)c = ac + bc 


(A + B)C = AC + BC 


Certain product rules hold. a0 = 0a =0 AO —0A-—0 
a(—b) = —ab A(—B) = —AB 
(—a)(—b) = ab (—A)(—B) = AB 


Differences 


(i) Number multiplication is commutative ; matrix multiplication is not. 
That is, there exist matrices A, B such that AB z BA. 


As a consequence, the matrix expansion (4 + B)? = A? + 2AB + E? 
is false, in general. The correct expansion is (A + BP = 
A? + AB + BA + B?. The matrix AB + BA is equal to 2AB if and 
only if A and B commute, i.e. AB = BA. This may happen in certain 
cases (for example if B = I), but it is not generally true. 


(ii) The product of two non-zero numbers is non-zero, but the product of 
two non-zero matrices may be equal to the zero matrix. 


For example, if 


1 1 1 1 
a=] jan = ) 
1 1 =1 -1 


then AB = O. (Note also that BA £ O.) This example also illustrates 
the existence of a non-zero matrix B such that B? = O. 
(iii) The cancellation law holds for numbers, but not for matrices. 


That is, if a, b, c are numbers and a 4 0, then 
(ab = ac) > (b = c) 

But for matrices, 
(AB = AC) + (B= C). 


For example, if A and B are as above, then AB = AO, but B is not 
equal to the zero matrix, i.e. A cannot be cancelled. 
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Solution 23.2.5.1 Solution 23.2.5.1 


For example, with 


we have 
E Im y 
+ * = 
3 4 3 —1 3 —2 3 9 
whereas 
Caa ioe all 
SE * SE 
3 4 3 —1 3 4 3 —2 
-[5 i 
«0 o4 E 
(continued from page 45) 


(iv) The equation x? — 0 has the unique solution x — 0, but the matrix 
equation X? = O has an infinite number of solutions. 


It can be shown that the general solution of the matrix equation 
X? — O for matrices with two rows and columns is 


s = 
AC for arbitrary r and non-zero s, 


or 
0 0 : 
X= for arbitrary t. 
t 0 
(v) The equation x? = —1 has no solution (in real numbers), but the 
matrix equation X? = —] does have solutions. 


We single out two solutions for special attention. These are the 
matrices 


0 —1 0 1 
C= and —C = 
1 0 =1 0 
Note that the matrix — I is equal to the rotation matrix R, correspond- 


ing to a rotation about the origin through the angle z. The matrix C 
is the rotation matrix R,,. What transformation does — C correspond 


to? 
Exercise 1 Exercise 1 
: : - - (2 minutes) 
Give expansions of (A — B)? and (A — B)? which are valid for all square 
matrices. (There are 8 terms in the second expansion.) || 
Exercise 2 Exercise 2 
(2 minutes) 


The equation x? = x has precisely two solutions, x = 0 or 1. By giving an 
example of a matrix with 2 rows and 2 columns, show that the matrix 
equation X? = X has solutions other than X = O or I. | 
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23.3 APPENDIX 


The “Dimension” Theorem 

This Appendix is optional. 

THEOREM 

If L isa morphism from a vector space V to a vector space U, then 


(dimension of L(V)) + (dimension of kernel) = (dimension of V) 


PROOF 


Let (u; 5, U3, ..., Up} bea basis for L(V) (ie. let L(V) have dimension p). 
and let (k, , kz, k,, ..., km} bea basis for the kernel (i.e. let the kernel have 
dimension m). Since each of the u’s belongs to L(V), there must be vectors 
in V which map to them. Suppose 


If we can show that the set of vectors B — icio do O v,} form 
a basis for V, then the theorem is proved, for this will show that V has 
dimension m + p. 


First of all, we show that any vector in V can be expressed in terms of these 
vectors, and then we show that B is a linearly independent set. 


(i) Let y be any element of V; then we can find o;'s (real numbers) such that 
L(v) = 04, + AU +--+ ApU y 


= 4/1(v,) + oo L(v;) + --- + o, L(v;) 


Le. 
L(v) = L(x,v, + 0505 +--+ + au) 
i.e. 
Lv) — Lav, + 030) +--+ + QU) = Uo 
Le. 
LY — 00, — 070) — &3U5 — ="* — App) = ug 
Thus (p — 0,0, — av, —--- — %,U,) is in the kernel of L. There exist, 


therefore, numbers f,,..., Bm such that 

U— 0407 — 0505 — --- — av, = Piki + Bok, +- + Paka 
Le. 

Je Bik, + P2k2 Por Bk, + OU, ae 20) E S. 


Thus any vector in V can be expressed as a linear combination of the 
vectors ki, k;,..., kms ,,..., v,, ie. B spans V. 
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(continued on page 48) 


Solution 23.2.6.1 
(A— BP = A? —AB— BA + E? 
(A — BP = 4? — A?B — ABA + AB? — BA? + BAB + B?A— B? 


E 
Solution 23.2.6.2 
0 0|. 
x= is an example. 
0 1 a 


(continued from page 47) 


(ii) We now show that B is a linearly independent set. Consider the 
expression 


By ky + Bok +++ PmEm + 0101 + 0707 +=" +00, vo (1) 


Then 

L(Bik, + Bok +--+ + &v,) = uo 
ie. 

B1L(k;) + B5L(k;) + --- + o,L(v,) = uo 
i.e. 


a, L(v,) + 0,1403) + --- + e, L(v) = uo, 


because all the k’s are in the kernel and so L(k;) = uy. But 


L(v,) = u, 
L(v;) = u; 
L(v,) — u, 


and the u’s have been chosen to be linearly independent. Thus, all the 
a’s are zero, and expression (1) reduces to 


Biki + Boks too Duk, =a Vo 


But the k’s are also linearly independent, and so the fi's are zero. 
Thus (1) implies that all the «’s and f’s are zero, and so the set is linearly 
independent, as required. 
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Solution 23.2.6.1 


Solution 23.2.6.2 


Unit No. 


NO 00-105 tA UNE 


NO TEXT 


NO TEXT 


NO TEXT 


NO TEXT 


Title of Text 


Functions 

Errors and Accuracy 
Operations and Morphisms 
Finite Differences 


Inequalities 

Sequences and Limits I 
Computing I 
Integration I 


Logic I — Boolean Algebra 
Differentiation I 
Integration II 

Sequences and Limits II 
Differentiation II 
Probability and Statistics I 
Logic II — Proof 
Probability and Statistics II 
Relations 

Computing II 

Probability and Statistics III 
Linear Algebra I 

Linear Algebra II 
Differential Equations I 


Linear Algebra III 
Complex Numbers I 
Linear Algebra IV 
Complex Numbers II 
Groups I 

Differential Equations II 


Groups II 

Number Systems 
Topology 

Mathematical Structures 
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